RelationTrack: Relation-aware Multiple Object Tracking with Decoupled Representation

نویسندگان

چکیده

Existing online multiple object tracking (MOT) algorithms often consist of two subtasks, detection and re-identification (ReID). In order to enhance the inference speed reduce complexity, current methods commonly integrate these double subtasks into a unified framework. Nevertheless, ReID demand diverse features. This issue results in an optimization contradiction during training procedure. With target alleviating this contradiction, we devise module named Global Context Disentangling (GCD) that decouples learned representation detection-specific ReID-specific embeddings. As such, provides implicit manner balance different requirements subtasks. Moreover, observe preceding MOT typically leverage local information associate detected targets neglect consider global semantic relation. To resolve limitation, develop module, referred as Guided Transformer Encoder (GTE), by combining powerful reasoning ability encoder deformable attention. Unlike previous works, GTE avoids analyzing all pixels only attends capture relation between query nodes few self-adaptively selected key samples. Therefore, it is computationally efficient. Extensive experiments have been conducted on MOT16, MOT17 MOT20 benchmarks demonstrate superiority proposed framework, namely RelationTrack. The experimental indicate RelationTrack has surpassed significantly established new state-of-the-art performance, e.g., IDF1 70.5% MOTA 67.2% MOT20.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple object tracking with context awareness

Multiple people tracking is a key problem for many applications such as surveillance, animation or car navigation, and a key input for tasks such as activity recognition. In crowded environments occlusions and false detections are common, and although there have been substantial advances in recent years, tracking is still a challenging task. Tracking is typically divided into two steps: detecti...

متن کامل

Object Tracking in Multiple Cameras with Disjoint

The specific problem we address in this chapter is object tracking over wide-areas such as an airport, the downtown of a large city or any large public area. Surveillance over these areas consists of the search for suspicious behavior such as persons loitering, unauthorized access, or persons attempting to enter a restricted zone. Until now, these surveillance tasks have been executed by human ...

متن کامل

Cooperative Object Tracking with Multiple PTZ Cameras

Research in visual surveillance systems is shifting from using few stationary, passive cameras to employing large heterogeneous sensor networks. One promising type of sensor in particular is the Pan-Tilt-Zoom (PTZ) camera, which can cover a potentially much larger area than passive cameras, and can obtain much higher resolution imagery through zoom capacity. In this paper, a system that can tra...

متن کامل

Energy-Aware Object Tracking Sensor Networks

In order to fully realize the potential of sensor networks , energy awareness should be incorporated into every stage of the network design and operation. As a representative application of sensor networks, object tracking sensor network faces many system and design challenges in terms of energy management. By studying the role of semantic location in object tracking queries, data dissemination...

متن کامل

Context - aware and Attentional Visual Object Tracking

Context-aware and Attentional Visual Object Tracking

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2022

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2022.3150169